Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
نویسندگان
چکیده
Cross-corpus speech emotion recognition(SER) is a hot topic in classification. SER includes these four issues:feature selection, differences constraint, label regression and preservation of discriminative features. Seldom literature can solve issues jointly previous studies.In this work,we propose the transfer emotion-discriminative features subspace learning(TEDFSL) method.Acoustic are extracted by OpenSMILE source target data. Then sent into CNN+BLSTM to learn higher-level global time series. The common low-dimensional data learned Linear Discriminant analysis (LDA) reduce dimension Maximum Mean Discrepancy (MMD) Graph Embedding (GE) constraint between low- dimensional combined with matrix relationship labels features,after which the, DNN selected as final classifier preserve features, emotion-aware center loss( $\mathrm {l}_{\mathrm {c}}$ ) added extensive experiments carried out on cross-corpus tasks results demonstrate that our proposed method superior state-of-art SER.
منابع مشابه
Emotion Recognition from Speech using Discriminative Features
Creating an accurate Speech Emotion Recognition (SER) system depends on extracting features relevant to that of emotions from speech. In this paper, the features that are extracted from the speech samples include Mel Frequency Cepstral Coefficients (MFCC), energy, pitch, spectral flux, spectral roll-off and spectral stationarity. In order to avoid the 'curse of dimensionality', statis...
متن کاملFeature Transfer Learning for Speech Emotion Recognition
Speech Emotion Recognition (SER) has achieved some substantial progress in the past few decades since the dawn of emotion and speech research. In many aspects, various research efforts have been made in an attempt to achieve human-like emotion recognition performance in real-life settings. However, with the availability of speech data obtained from different devices and varied acquisition condi...
متن کاملSpeaker Emotion Recognition Based on Speech Features and Classification Techniques
Speech Processing has been developed as one of the vital provision region of Digital Signal Processing. Speaker recognition is the methodology of immediately distinguishing who is talking dependent upon special aspects held in discourse waves. This strategy makes it conceivable to utilize the speaker's voice to check their character and control access to administrations, for example voice diali...
متن کاملSpeech Emotion Recognition Using Scalogram Based Deep Structure
Speech Emotion Recognition (SER) is an important part of speech-based Human-Computer Interface (HCI) applications. Previous SER methods rely on the extraction of features and training an appropriate classifier. However, most of those features can be affected by emotionally irrelevant factors such as gender, speaking styles and environment. Here, an SER method has been proposed based on a concat...
متن کاملSpeech Emotion Recognition Considering Local Dynamic Features
Recently, increasing attention has been directed to the study of the speech emotion recognition, in which global acoustic features of an utterance are mostly used to eliminate the content differences. However, the expression of speech emotion is a dynamic process, which is reflected through dynamic durations, energies, and some other prosodic information when one speaks. In this paper, a novel ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3282982